
United States Patent and Trademark Office 




EPARTMENT OF COMMERCE 
and Trademark Office 
MISSIONER FOR PATENTS 
Box 1450 
pandria, Virginia 22313-1450 
usplo.gov 



| APPLICATION NO. 


FILING DATE 


FIRST NAMED INVENTOR 


ATTORNEY DOCKET NO. 


CONFIRMATION NO. | 


10/733,541 


12/11/2003 


Yunbo Cao 


M61. 12-0556 


7738 



27366 7590 1 1/13/2006 

WESTMAN CHAMPLIN (MICROSOFT CORPORATION) 
SUITE 1400 

4 900 SECOND AVENUE SOUTH 
MINNEAPOLIS, MN 55402-3319 



EXAMINER 



LY, CHEYNE D 



ART UNIT 



PAPER NUMBER 



2168 

DATE MAILED: 11/13/2006 



Please find below and/or attached an Office communication concerning this application or proceeding. 



PTO-90C (Rev. 10/03) 



ijTTice Mciion ou hi friary 


Application No. 

10/733,541 


AoDlicantfs) 

CAO ET AL. 


Examiner 

Cheyne D. Ly 


Art Unit 

2168 
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Period for Reply 



A SHORTENED STATUTORY PERIOD FOR REPLY IS SET TO EXPIRE 3 MONTH(S) OR THIRTY (30) DAYS, 
WHICHEVER IS LONGER, FROM THE MAILING DATE OF THIS COMMUNICATION. 

- Extensions of time may be available under the provisions of 37 CFR 1 . 1 36(a). In no event, however, may a reply be timely filed 
after SIX (6) MONTHS from the mailing date of this communication. 

- If NO period for reply is specified above, the maximum statutory period will apply and will expire SIX (6) MONTHS from the mailing date of this communication. 

- Failure to reply within the set or extended period for reply will, by statute, cause the application to become ABANDONED (35 U.S.C. § 133). 
Any reply received by the Office later than three months after the mailing date of this communication, even if timely filed, may reduce any 
earned patent term adjustment See 37 CFR 1 .704(b). 

Status 

1)^ Responsive to communication(s) filed on ^l^j_lzttt> 
2a)IEI This action is FINAL. 2b)D This action is non-final. 

3) D Since this application is in condition for allowance except for formal matters, prosecution as to the merits is 

closed in accordance with the practice under Ex parte Quayle, 1935 CD. 1 1 , 453 O.G. 213. 

Disposition of Claims 

4) ^3 Claim(s) 1-8 and 25-34 is/are pending in the application. 

4a) Of the above claim(s) is/are withdrawn from consideration. 

5) D Claim(s) is/are allowed. 

6) IEI Claim(s) 1-8 and 25-34 is/are rejected. 

7) D Claim(s) is/are objected to. 

8) D Claim(s) . are subject to restriction and/or election requirement. 

Application Papers 

. 9)D The specification is objected to by the Examiner. 

10) D The drawing(s) filed on is/are: a)D accepted or b)D objected to by the Examiner. 

Applicant may not request that any objection to the drawing(s) be held in abeyance. See 37 CFR 1 .85(a). 
Replacement drawing sheet(s) including the correction is required if the drawing(s) is objected to. See 37 CFR 1.121(d). 

11) D The oath or declaration is objected to by the Examiner. Note the attached Office Action or form PTO-152. 

Priority under 35 U.S.C. § 119 

12) D Acknowledgment is made of a claim for foreign priority under 35 U.S.C. § 1 19(a)-(d) or (f). 
a)D All b)Q Some * c)Q None of: 

1 .□ Certified copies of the priority documents have been received. 

2. D Certified copies of the priority documents have been received in Application No. . 

3. D Copies of the certified copies of the priority documents have been received in this National Stage 

application from the International Bureau (PCT Rule 17.2(a)). 
* See the attached detailed Office action for a list of the certified copies not received. 
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DETAILED ACTION 

1 . Applicants' arguments filed September 01 , 2006 have been fully considered but they are 
not deemed to be persuasive. Rejections and/or objections not reiterated from previous office 
actions are hereby withdrawn. The following rejections and/or objections are either 
reiterated, or newly applied as necessitated by claim amendments. They constitute the 
complete set presently being applied to the instant application. 

2. The 35 USC 101 and 103 rejections are hereby withdrawn as necessitated by claim 
amendments. Further, Applicant's arguments directed to withdrawn rejections are moot. 

3. The addition of claims 25-34 and cancellation of claims 9-24 have been acknowledged. 

4. Claims 1-8 and 25-34 are examined on the merits. 

OBJECTIONS 

5. Claim 2 is objected to because the status identifier being "(original)", while the claim is 
being amended. Further, said claim recites the term "inforamtion" in line 3. Appropriate 
correction is required. 

Claim Rejections - 35 USC § 112 

6. The following is a quotation of the first paragraph of 35 U.S.C. 112: 

The specification shall contain a written description of the invention, and of the manner and process of making 
and using it, in such full, clear, concise, and exact terms as to enable any person skilled in the art to which it 
pertains, or with which it is most nearly connected, to make and use the same and shall set forth the best mode 
contemplated by the inventor of carrying out his invention. 

7. Claims 1-8 and 25-34 are rejected under 35 U.S.C. 112, first paragraph, as failing to 
comply with the written description requirement. The claim(s) contains subject matter which 
was not described in the specification in such a way as to reasonably convey to one skilled in 
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the relevant art that the inventor(s), at the time the application was filed, had possession of 
the claimed invention. NEW MATTER. 

8. Claim 1 recites the limitations of "extracting a first set of related elements. .." and 
"extracting a second set of related elements. . wherein said limitations have not been found 
in the instant specification. 

Claim Rejections - 35 USC § 103 

9. The following is a quotation of 35 U.S.C. 1 03(a) which forms the basis for all 

obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 

10. This application currently names joint inventors. In considering patentability of the 
claims under 35 U.S.C. 103(a), the examiner presumes that the subject matter of the various 
claims was commonly owned at the time any inventions covered therein were made absent 
any evidence to the contrary. Applicant is advised of the obligation under 37 CFR 1 .56 to 
point out the inventor and invention dates of each claim that was not commonly owned at the 
time a later invention was made in order for the examiner to consider the applicability of 35 
U.S.C. 103(c) and potential 35 U.S.C. 102(e), (f) or (g) prior art under 35 U.S.C. 103(a). 

1 1. Claims 1-8 and 25-34 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Yangarber et al. (2000) (Yangarber hereafter) in view of Soderland (1999). 
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CLAIM INTERPRETATIONS 

1 2. In regard to the limitations "extracting a first set of related elements. . and "extracting a 
second set of related elements...", Yangarber describes the method of extracting information 
comprising "first element", "second element", "third element", etc. and a corpus of 5,963 
articles from the Wall Street Journal (page 284, column 1, last two lines, to column 2, line 
22). The description of the respective elements support the respect set of elements is 
different from each of the related elements. 

MOTIVATION TO COMBINE 

13. Yangarber discloses IE systems "for finding patterns automatically from un-annotated 
text" (page 282, Abstract etc.). While, Soderland describes the performance of WHISK is 
comparable to other IE systems. For structured or semi-structure text, high or even perfect 
accuracy can often be achieved from a modest amount of training examples (page 236 to 
page 238). Therefore, one of ordinary skill in the art at the time of the invention would have 
been motivated by Soderland to improve the system of Yangarber for high or even perfect 
accuracy. 

PRIOR ART 

14. In regard to claims 1 and 4, Yangarber discloses a computer-implemented method of 
extracting information from an information source comprising a plurality of documents (page 
283, column 2, lines 5-9, especially, "documents", and page 286, column 2, section 4.2, 
especially, "corpus of 100 documents"); 

Accessing strings of text in he information source (page 283, column 2, lines 10-17, 
especially, "patterns"); 
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Comparing the strings of text in the information source with generalized extraction 
patterns and identifying a plurality of strings in the information source that match at 
least one generalized extraction pattern (page 283, column 1 5 lines 1-5, especially, 
"generalize them into patterns", and column 2, lines 32-34, especially, "Generalize 
each pattern..."), the generalized extraction patterns including words and wildcards 
(page 285, column 1, section 3.5, especially, "wild-cards"); 

Extracting a first set of related elements of text pertaining to a topic from a first string 
of the plurality of strings based on a corresponding set of related elements pertaining 
to the topic in the at least one generalized extraction pattern, the first string being 
associated with a first document in the plurality of documents (page 282, column 1, 
last 5 lines, and page 285, column 1, section 3.4, especially, "management 
succession" as "topic"); 

Extracting a second set of related elements of text pertaining to the topic from a 
second string of plurality of strings based on the corresponding set of related elements 
in the at least one generalized extraction pattern, the second string being associated 
with a second document in the plurality of documents, wherein at least one of the 
related elements of text in the first set of related elements is different from each of the 
related elements of text in the second set of related elements of text (page 285, 
column 1, section 3.5, to column 2, last line, ); and 

Outputting the first related set of elements and the second set of related elements 
(page 285, column 1, last 3 lines). 
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15. However, Yangarber does not explicitly describe that the wildcards denote that at least 
one word in an individual string can be skipped in order to match the individual string to an 
individual generalized extraction pattern. Soderland discloses he wildcards denote that at 
least one word in an individual string can be skipped in order to match the individual string 
to an individual generalized extraction pattern (page 238, section 2.1, especially, "wildcard 

means to skip any number of characters. .."). Therefore, it would have been obvious to 
one of ordinary skill in the art at the time of the invention to make and use the system of 
Yangarber with the wildcard feature described by Soderland for high or even perfect 
accuracy. 

16. In regard to claim 2, Yangarber describes processing the first related set of elements and 
the second set of related elements to analyze data in the information source (page 284, 
column 1, last two lines, to column 2, line 22). Therefore, it would have been obvious to one 
of ordinary skill in the art at the time of the invention to make and use the system of 
Yangarber with the wildcard feature described by Soderland for high or even perfect 
accuracy. 

17. In regard to claim 3, Yangarber describes all the limitations of said claims except for the 
limitation of "the generalized extraction patterns there is at least one word positioned 
between at least one of the elements and the indication." Soderland describes the limitation 
of "the generalized extraction patterns there is at least one word positioned between at least 
one of the elements and the indication" (page 238, section 2.1, to page 241, especially, 
Figures 6, 8, and 10), as required by the claimed invention. Therefore, it would have been 
obvious to one of ordinary skill in the art at the time of the invention to make and use the 



Application/Control Number: 1 0/733,541 Page 7 

Art Unit: 2168 

system of Yangarber with the wildcard feature described by Soderland for high or even 
perfect accuracy. 

18. In regard to claims 5-8, Yangarber and Soderland render the claimed in invention obvious 
in light of the above cited prior art as a whole. Further, Soderland specifically describes it is 
well known that IE systems such as WHISK comprises the claimed "computer readable 
medium" (page 261, section 6.1, especially, "database applications and software agent 
technology can be enabled by IE systems"). Therefore, it would have been obvious to one of 
ordinary skill in the art at the time of the invention to make and use the system of Yangarber 
with the wildcard feature described by Soderland for high or even perfect accuracy. 

19. In regard to claim 25, Yangarber describes each of the elements of the first set of related 
elements of text are different from each of the elements of the second set of related elements 
of text (page 284, column 1, last two lines, to column 2, line 22). Therefore, it would have 
been obvious to one of ordinary skill in the art at the time of the invention to make and use 
the system of Yangarber with the wildcard feature described by Soderland for high or even 
perfect accuracy. 

20. In regard to claim 26, Yangarber describes the corresponding related set of elements refer 
to general elements pertaining to the topic and the first set of related elements and the second 
set of related elements refer to specific text associated with the general elements (page 284, 
column 1, last two lines, to page 285, column 1, section 3.4, especially the describe elements, 
and generalization and concept classes). Therefore, it would have been obvious to one of 
ordinary skill in the art at the time of the invention to make and use the system of Yangarber 
with the wildcard feature described by Soderland for high or even perfect accuracy. 
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21. In regard to claims 27 and 28 5 Yangarber describes all the limitations of said claims 
except for the limitation of "company/product pair." Soderland describes the limitation of 
"company/product pair" (page 235 in its entirety, especially, "Z Corp" and "maker of 
semiconductors") as required by the claimed invention. Therefore, it would have been 
obvious to one of ordinary skill in the art at the time of the invention to make and use the 
system of Yangarber with the wildcard feature described by Soderland for high or even 
perfect accuracy. 

22. In regard to claim 29, Yangarber describes "the plurality of documents include at least 
one of a collection of documents, news articles..." (page 284, column 1, last two lines, to 
column 2, line 22, especially, the Wall Street Journal). Therefore, it would have been 
obvious to one of ordinary skill in the art at the time of the invention to make and use the 
system of Yangarber with the wildcard feature described by Soderland for high or even 
perfect accuracy. 

23. In regard to claims 30-34 Yangarber and Soderland render the claimed in invention 
obvious in light of the above cited prior art as a whole. Further, Soderland specifically 
describes it is well known that IE systems such as WHISK comprises the claimed "computer 
readable medium" (page 261, section 6.1, especially, "database applications and software 
agent technology can be enabled by IE systems"). Therefore, it would have been obvious to 
one of ordinary skill in the art at the time of the invention to make and use the system of 
Yangarber with the wildcard feature described by Soderland for high or even perfect 
accuracy. 
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CONCLUSION 

24. The prior art made of record and not relied upon is considered pertinent to applicant's 
disclosure. Hull et al. (1987) and Ravichandran et al. (2002). 

25. Applicant's amendment necessitated the new ground(s) of rejection presented in this 
Office action. Accordingly, THIS ACTION IS MADE FINAL. See MPEP § 706.07(a). 
Applicant is reminded of the extension of time policy as set forth in 37 CFR 1 .136(a). 

26. A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within TWO 
MONTHS of the mailing date of this final action and the advisory action is not mailed until 
after the end of the THREE-MONTH shortened statutory period, then the shortened statutory 
period will expire on the date the advisory action is mailed, and any extension fee pursuant to 
37 CFR 1.136(a) will be calculated from the mailing date of the advisory action. In no event, 
however, will the statutory period for reply expire later than SIX MONTHS from the date of 
this final action. 

27. Patent applicants with problems or questions regarding electronic images that can be 
viewed in the Patent Application Information Retrieval system (PAIR) can now contact the 
USPTO's Patent Electronic Business Center (Patent EBC) for assistance. Representatives are 
available to answer your questions daily from 6 am to midnight (EST). The toll free number 
is (866) 21 7-9197. When calling please have your application serial or patent number, the 
type of document you are having an image problem with, the number of pages and the 
specific nature of the problem. The Patent Electronic Business Center will notify applicants 
of the resolution of the problem within 5-7 business days. Applicants can also check PAIR to 
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confirm that the problem has been corrected. The USPTO's Patent Electronic Business 
Center is a complete service center supporting all patent business on the Internet. The 
USPTO's PAIR system provides Internet-based access to patent application status and history 
information. It also enables applicants to view the scanned images of their own application 
file fblder(s) as well as general patent information available to the public. 

28. For all other customer support, please call the USPTO Call Center (UCC) at 800-786- 
9199. The USPTO's official fax number is 571-272-8300. 

29. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to C. Dune Ly, whose telephone number is (571) 272-0716. The 
examiner can normally be reached on Monday-Friday from 8 A.M. to 4 P.M. 

30. If attempts to reach the examiner by telephone are unsuccessful, the examiner's 

supervisor, Tim Vo, can be reached on (571) 272-3642. 

C. Dune Ly 
Patent Examiner 
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